Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

نویسندگان

  • Cees H. Taal
  • Richard C. Hendriks
  • Richard Heusdens
چکیده

A speech pre-processing algorithm is presented that improves the speech intelligibility in noise for the near-end listener. The algorithm improves intelligibility by optimally redistributing the speech energy over time and frequency according to a perceptual distortion measure, which is based on a spectro-temporal auditory model. Since this auditory model takes into account shorttime information, transients will receive more amplification than stationary vowels, which has been shown to be beneficial for intelligibility of speech in noise. The proposed method is compared to unprocessed speech and two reference methods using an intelligibility listening test. Results show that the proposed method leads to significant intelligibility gains while still preserving quality. Although one of the methods used as a reference obtained higher intelligibility gains, this happened at the cost of decreased quality. Matlab code is provided. Crown Copyright © 2013 Published by Elsevier Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Speech Processing Strategy for Cochlear Implants Based on objective measures for predicting speech intelligibility

The purpose of this study was to improve the speech processing strategy for cochlear implants (CIs) A speech preprocessing algorithm is presented to improve the speech intelligibility in noise. The algorithm improves the intelligibility by optimally redistributing the speech energy over time and frequency for a perceptual distortion measure, the algorithm is more sensitive to transient regions....

متن کامل

A Low-Complexity Spectro-Temporal Distortion Measure for Audio Intelligibility Improvement

A speech pre-processing algorithm is presented to improve the speech intelligibility in noise for the near-end listener. The algorithm improves the intelligibility by optimally redistributing the speech energy over time and frequency for a perceptual distortion measure, which is based on a spectro-temporal auditory model. In contrast to spectral-only models, short-time information is taken into...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

Speech/Noise-Dominant Decision regardless of SNR for Speech Enhancement

In speech enhancement, a decision between speech dominant and noise one is important to reduce noise for increasing intelligibility. This paper presents a speech/noise-dominant decision regardless of SNR. In the proposed decision, the influence of noise is reduced by subtracting the noise component. Therefore, the proposed method decides between the speech dominant and noise one accurately. Fro...

متن کامل

Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments

We propose a joint framework combining speech enhancement (SE) and voice activity detection (VAD) to increase the speech intelligibility in low signal-noise-ratio (SNR) environments. Deep Neural Networks (DNN) have recently been successfully adopted as a regression model in SE. Nonetheless, the performance in harsh environments is not always satisfactory because the noise energy is often domina...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2014